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ABSTRACT 


The  probability  density  distribution  is  discussed  for 
two  correlated  random  variables  based  on  an  approximation 
to  a  normal  (Gaussian)  law  using  Hermite  polynomials  of 
two  variables. 


INTRODUCTION 


In  many  physical  theories  the  following  problem  must  be  solved: 
Given  a  large  number  of  experimental  values  of  p  quantities 
Xj,...Xp,  which  are  considered  as  random  variables  having  a  known  joint 
probability  density 

f (x.  ... .x  ,  0. ... .0  ) 
v  1  *  p  l  q 

A  a 

how  can  an  estimate  0^  ,  ...0  be  made  of  the  q  unknown  parameters  0j,...0t 
such  that  the  surface 


z  =  f (x,  , . . .x  ,  , . . .$  ) 

v  1  q  1  q 


fits  the  "cloud"  of  the  experimental  points  as  well  as  possible? 

This  problem  has  been  solved  by  several  well-known  methods  during 
the  last  few  decades;  each  of  these  solutions  is  based  on  a  particular 
definition  of  the  measure  of  the  discrepancy  between  the  surface  and  the 
experimental  cloud.  There  has  been  much  discussion  about  the  best  choice 
for  the  measure  of  the  discrepancy  among  the  different  schools.  Neverthe¬ 
less,  it  can  be  said,  that  in  many  physical  problems  estimates  can  now  be 
made  following  one  or  the  other  of  these  methods  which  lead  to  results  of 
great  significance  for  theoretical  physics. 

But  the  situation  in  fluid  dyhamics  is  very  different  if  we  want  to 
study  a  turbulent  flow  statistically.  For  instance,  taking  the  most 
simple  case  of  a  homogeneous  turbulence,  let  us  suppose  that  a  great  number 
of  measurements  have  been  made  of  velocity  components  (U^,  U2,  U^)  and 
(Vj,  1^,  V^)  at  two  different  points  with  a  constant  interval  of  time  be¬ 
tween  the  two  measurements.  No  theory  has  yet  been  established  giving  the 
mathematical  expression  for  the  joint  probability  density 


.1.  *  *  V,V 


U2,  U3,  Vj,  V2,  V3,  9i»«“0q) 

which  these  six  random  variables  must  follow,  and  thus  we  are  confronted 
by  a  much  more  difficult  situation.  Before  making  any  estimate  of  the  un¬ 
known  parameters  we  must  first  guess  which  particular  function 

must  be  used  for  the  joint  probability  density. 

For  this  problem,  statistical  mechanics  is  able  only  to  give  a  few 
hints,  which  are,  however,  of  great  value.  In  the  classical  case  of  a 
mechanical  system  with  a  finite  number  of  degrees  of  freedom,  the  prob¬ 
ability  law,  according  to  J.  W.  Gibbs,  is  given  by  the  canonical  distri¬ 
bution  in  the  phase  space.  It  has  been  shown  that  for  some  continuous 
media1  having  a  countable  number  of  degrees  of  freedom  (for  instance  a 
vibrating  string)  this  result  is  still  true.  It  is  still  the  normal  law 
which  maximizes  the  entropy  (as  defined  in  information  theory) ,  and  thus 
generalizes  the  canonical  distribution  to  the  function  space  describing  the 
phases  of  the  continuous  medium. 

It  seems  natural  to  take  as  phase  space  for  a  homogeneous  turbulent 
flow,  the  function  space  of  all  vector  functions  Uj  (Xj,  X2,  X^) ,  U2  (X^ 
X2>  Xj) ,  U3  (Xj,  X2,  Xj)  which  are  square  integrable  in  any  bounded  domain 
(this  space  is  called  A  in  Reference  2;  the  reason  leading  to  this  choice 
is  discussed  in  Reference  3) .  The  main  feature  of  A  is  that  the  energy  of 
any  finite  portion  of  the  fluid  is  always  finite.  A  point  in  A  is  defined 
by  a  countable  number  of  coordinates  (A  can  be  considered  as  a  direct  sum 
of  a  countably  number  of  Hilbert  spaces).  Is  it  then  possible  to  extend 
the  results  proved  for  the  vibrating  string  to  the  turbulent  flow  of  an 
inviscid  fluid?  The  answer  to  this  question  is  negative.  The  canonical 
distribution  of  Gibbs  and  its  extension  to  some  continuous  media  is 
essentially  based  on  the  hypothesis  that  the  systems  are  conservative;  the 
canonical  distribution  and  its  extensions  correspond  to  a  statistical 
equilibrium. 
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References  are  listed  on  page  23. 
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If  we  assume,  as  we  always  do  in  research  on  turbulence,  that  the 
fluid  is  viscous,  then  there  is  a  constant  dissipation  of  energy.  If 
through  the  maximum  entropy  principle,  we  associate  normal  law  and 
statistical  equilibrium,  we  cannot  expect  that  the  joint  probability 
density  of  the  random  variables  would  be  the  normal  probability  density. 

The  logical  consequence  of  the  preceding  remarks  is  that  not  being 
able  to  take  advantage  of  the  results  linked  with  the  statistical  equilib¬ 
rium,  we  must  start  as  near  as  possible  to  this  case,  i.e.,  as  a  first 
step  we  suppose  that  our  turbulent  flow  is  near  the  statistical  equilib¬ 
rium.  In  mathematical  terms  we  must  make  the  hypothesis  that  the  joint 
probability  density  is  given  by  the  fi-st  terms  of  a  series 


f 

o 


•V 


•v 


+  f,  (x 


1’ 


.x 


■p’  V 


•V 


where  f  is  the  normal  probability  density  and  the  subsequent  terms  f  1 , 
f2  ...  mark  the  difference  between  the  actual  statistical  state  of  the 
system  and  the  statistical  equilibrium. 

Obviously,  the  joint  probability  density  can  be  put  in  the  form 


f0  (X1- 


•V 


P(x 


r 


Where 

f  f 

P=l+-r^  +  -r^+...  (f  >0) 
f  f  v  o  ' 

o  o 

As  in  most  of  the  approximations  made  in  physics,  it  seems  logical 
to  begin  by  assuming  that  P  is  a  polynomial  in  x^,  . ..x^.  Because  of  the 
exponential  form  of  fQ,  the  simplest  way  to  express  this  polynomial  seems 
to  be  to  use  the  Gram-Charlier  series,  based  on  Hermite  polynomials. 

In  the  following  pages  we  will  summarize  the  essential  features  of 
the  theory,  giving  special  attention  to  the  two-dimensional  case,  because 
most  of  the  literature  has  been  confined  to  the  one -dimensional  case. 

For  a  general  exposition  of  the  theory  of  Hermite  polynomials  in  one 
and  several  variables,  see  Reference  4;  for  the  one-dimensional  Gram- 
Charlier  series,  we  refer  to  Cramer,  pp  221-231,  and  Kendall,  pp  145-150,** 


where  further  bibliographical  references  are  given.  We  are  indebted  to 
Dr.  Lieblein,  who  has  pointed  out  to  us  papers,  References  7  and  8,  which 
are  among  the  very  few  devoted  to  the  two-dimensional  case. 

Note  that  the  Gram-Charlier  approximation  of  the  bivariate  normal 
law,  suggested  here,  differs  on  an  essential  point  from  that  used  in  the 
literature  known  to  us. 

As  a  rule,  to  represent  a  function  of  two  variables  f(x,  y) ,  one 
uses  the  polynomials  Hm(x)  Hn()0*  From  a  purely  theoretical  point  of  view 
this  is  perfectly  correct  because  it  is  well  known  that  if  a  sequence 

(x),  ^  (x),...<J>nCx), . 


defines  an  orthonormal  basis  in  the  Hilbert  space  of  functions  of  one 
variable  f(x)  (which  are  square  integrable),  the  sequence  of  the  product 

♦o  ♦<>  Vx) 

constitutes  an  orthonormal  basis  in  the  Hilbert  space  of  functions  of  two 
variables  f(x,y) . 

This  assumption  leads  to  the  following  representation  of  the  bi¬ 
variate  probability  density 

(x-m)2  (y-n)2 

2  2 
2  a  2t 

+a>,  4-ao 

p(x->°  ■  5 BTT  e  [£  Ai,k  v*r*] 

o,  o 


Correct  from  the  point  of  view  of  the  representation  of  a  function 
of  two  variables  by  a  series,  this  development  has  a  major  disadvantage. 
If  we  assume 


4 


and  all 


A.  k  -  0  (J  -  ifi*  -  1) 


the  function 


PQ  (x,y) 


1 

2  IT  0  T 


e 


does  not  represent  the  bivariate  normal  law  in  the  general  case;  it  only 
fits  if  the  two  normal  random  variables  x  and  y  are  independent. 
Consequently,  a  limited  development 


P(x,y) 


1 _ 

2ir  a  x  6 


(x-m)2 

2a2 


j+k=2n 


j+k=0 


H. 

J 


(x-m)  H  (y-n)  “I 
a  k  t  J 


is  a  possible  approximation  of  the  real  probability  density,  but  it  is  not, 
strictly  speaking,  its  Gram-Charlier  approximation. 

To  generalize  the  one-dimensional  Gram-Charlier  approximation  to 
bivariate  probability  density,  we  must  start  from  the  exponential 


1 

2ir  o  t 


exp 


_1 _ 

2(l-r2) 


and  use  polynomials  in  (x,y)  deduced  from  this  general  normal  law  exactly 
as  the  Hr  in  one  variable  is  deduced  from  the  one -dimensional  law.  These 
polynomials  will  reduce  to  products  if,  and  only  if, 

r  «  0. 

Fortunately,  these  polynomials  are  known.  They  were  discovered  by 
Hermite  in  1864  (Reference  4,  p  363).  It  is  precisely  in  the  case  of  two 
variables  that  he  has  made  his  most  original  contribution.  It  is  now 
known  that  in  the  one-variable  case,  Tschebyscheff  was  his  predecessor  by 
4  years.  But  in  the  two-variable  case  Hermite  introduced  two  adjunct 
sequences 

Gm,n  and  Hm,n  ^ 
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of  polynomials,  connected  with  two  positive  definite  quadratic  forms 


2  2 
<J>  (x,y)  =  ax  +2  bxy  +  cy 

>  >  2  < 
a  -  o,  b  -  0  A  =  ac  -  b  -  o 


and  its  adjunct 


*  (5,  n)  =  f-52 


ir  Cn  *  f  »2 

A  A 


These  two  sequences  of  polynomials  are  biorthogonal . 

The  remarkable  properties  of  the  polynomials  G  and  H  very  easily 
lead  to  the  true  extension  of  the  Gram-Charlier  approximation  for  a  bi¬ 
variate  density. 

Not  only  are  the  formulas  for  the  computation  of  the  coefficients 
much  simpler,  but  the  meaning  of  the  development  is  also  changed.  If 
we  suppose  all  the  coefficients  A.  .  =  0  except  A  =1,  then  the  unique 
term  left  coincides  with  the  general  bivariate  law  with  an  arbitrary 
correlation  coefficient  r. 

The  main  contribution  here  is  thus  to  substitute  for  the  usual 

series  a  new  Gram-Charlier  series  based  on  the  polynomials  G  (x,y)  and 
_ _  _ m,n _ 

H  (x,y)  introduced  by  Ch.  Hermite. 
m,n  _ _ _ 

The  computation  of  the  sufficient  condition  for  the  polynomial 
P4(x)  to  be  positive  was  made  by  Mr.  M.  Pine. 

ONE-DIMENSIONAL  GRAM-CHARLIER  SERIES 


The  Hermite  polynomials  in  one  variable  x  are  defined  by  the 


generating  function 


.  h 

hx  -  ■=“  ,  n 

*  2-ET*r"nM 


[1] 


from  where  we  deduce 


Hn  (x)  =  (-l)n 


[2] 
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If  n  is  even. 


and  if  n  is  odd. 


We  have 


"n  '-x)  *  H„W 


H  (-X)  =  -  H  (x) 
n  n 


H 

(x) 

- 

1 

0 

Hi 

(X) 

= 

X 

H2 

(X) 

- 

2 

X  - 

1 

H3 

(X) 

= 

3 

X  - 

3x 

H4 

(X) 

= 

4 

x  - 

6  x2 

+ 

3 

“s 

(X) 

5 

x  - 

10x3 

+ 

15x 

»6 

(X) 

= 

6 

x  - 

15x^ 

+ 

45  x'2  -  15 

H7 

(X) 

= 

7 

x  - 

21x5 

+ 

105x3  -  105x 

H8 

(X) 

0. 

8 

x  - 

28x6 

+ 

210x4  -  420x2  ■*  105 

H9 

(X) 

= 

9 

x  - 

36x7 

+ 

378x5  -  1260x3  +  945x 

H10 

(X) 

- 

10 

X 

45x8 

+ 

630x6  -  3150x4  +  4725x2 

4 

.  * 


-  945 


The  Hermite  polynomials  satisfy  the  following  equations 


hh  dk  H 

l£=nHn-l  '^rIL=  n(n-l)...(n-k+l)  Hn_k 


[3] 


H  (x)  -  x  H  . (x)  +  (n-1)  H  «(x)  =  0 
nv  n-1  v  n-2v 


[3'] 


d2  Hn  d  Hn 

- =  -  x  -  +  n  H  =0 

2  ,  n 


d  x 


dx 


[3"] 
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The  polynomial  Hn(x)  has  n  real  roots. 

We  have  the  pair  of  conjugate  Fourier  transforms 

2  k2 

-xZ  1  -  +“  -ikx  -  — 

e  2  H  (x)  =  7TT  e  (ik)  n  dk  [4] 

n  •'-00 

(x  real) 


n 

(ik)  e 


k; 

2 


T 


*rJ. 


+»  ikx  -  4- 


H  (x)  dx 
n  v 


[5] 


(k  real) 


The  Hermite  polynomials  have  the  fundamental  orthogonality  property 

2 


I 


+00  2 

e  H  (x)  H  (x)  dx  =  2  *  n!  6 

m  n  m,n 


[6] 


which  results,  fo-  the  Fourier  coefficients  A  ,  in 

n 


f(x)  = 


iC 


x 

2 


+  00 

[E  w-j 


m 


and 


f(x)  H. 


n 


(x)  dx 


[8] 


The  right-hand  side  of  Equation  [7]  is  known  as  the  Gram-Charlier  series  of 
f (x) ;  we  will  find  in  Reference  4,  pp  351-355,  sufficient  conditions  for 
the  validity  of  the  representation  of  a  given  function  f(x)  by  this  series. 
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GRAM-CHARLIER  APPROXIMATION  OF  THE  PROBABILITY  DENSITY  FOR  ONE 

RANDOM  VARIABLE 

Let  us  consider  a  random  variable  X  having  moments  of  all  orders. 

We  put 

X  =  m  [9] 


(X  -  m)2  =  o2  [10] 

and  consider  the  moment  centered  at  expectation 


(X  -  m)  =  Mk,  k  =  3,4,.... 


Let  us  assume  that  the  probability  density  of  x  is  given  by: 

(x-m)2 

i  "  ?  2 

P(x)=-7=4=re  Z0  P2n  (x) 


[11] 


[12] 


where  P2n  (x)  is  a  polynomial  of  degree  2n  (if  we  take  a  polynomial  of  odd 
degree  2n  +  1,  we  will  have  the  consequence  that  for  x  -►  or  x  -►  +  », 
depending  on  the  coefficient  of  x2n+*,  the  probability  density  P  (x)  will 
have  very  large  negative  values,  which  is  absurd).  It  is  always  possible 
to  express  any  polynomial  as  a  sum  of  Hermite  polynomials.  Thus  we  can 
write 


P(x) 


A.  H.  (— ) 
3  j  o 


Let  us  note  that  P(x)  is  an  entire  function  of  order  2  of  x. 


[13] 
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From  the  orthogonal  property.  Equation  [6],  we  get  the  remarkably 
simple  expression  for  the  coefficients 


namely, 


A  =  —  .  H 
.i  n  !  n  v  a 


[14] 


A  =  1 
o 


Aj  =  0 


a2  =  0 


[15] 

[16] 
[17] 


A3  =  3  !  3 

o 


[18] 


1  m4 

a4-t.  <t-3> 

a 


[19] 


1  v5  u3 
A5  =  T  !  (T  '  10  3^ 


[20] 


1  ,U6 


A6  *  1  C-f  -  15  -1  ♦  15) 


[21] 


We  can  have  equivalent  expressions  in  terms  of  the  cumulants  in 
place  of  the  centered  moments  yn.  Let  us  remember  that  <Ht)  being  the 
characteristic  function 

i  t  X 

<Kt)  =  e 

the  cumulants  are  defined  by  the  expression 


10 


[22] 


'»•  «*>  -E  >,  ^ 

1 

the  series  being  convergent  in  the  circle  |t|  <  6,  and  where  <f>(t)  >  0. 
We  find 


Thus,  with  this  notation,  we  obtain  the  equivalent  expressions 


A  =  1 
o 

A,  =  0 
A2  s  0 

A3  ■  K3  6  ^ 

A.  =  K.  24  o4 
4  4 


The  expression  of  the  characteristic  function  <J>(t)  corresponding  to 
the  probability  density  P(x)  defined  by  Equation  [13]  can  easily  be  com¬ 
puted.  From  Equation  [5]  we  obtain 


itm 

(iot)n  e 


2  2 
o  t 


+00 


it  x  - 


[23] 


thus 


Kt)  =  e 


imt 


rE 


2n 


A. 

3 


(i  o  t)' 


[24] 


The  simplest  way  to  compute  the  moments  in  terms  of  the  (i.e., 
the  inverse  formula  of  Equation  [14])  is  to  compute  the  coefficients  of  t 
in  the  development  of  the  entire  function  of  t  defined  by  [24]. 
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If  we  assume  that  all  the  Ajj  for  n  -  1  are  equal  to  zero,  the 
probability  density  P(x)  is  simply  the  normal  law. 

The  first  step,  to  use  the  Gram-Charlier  approximation,  would  be 

to  try 


P(x)  =  -1-e 

ilia 


(x-m)2 

2°2  [i  ♦*,  h3  (“>  a4  h4  (*f) 


[25] 


the  coefficients  A^  and  A^  being  computed  from  the  known  moments  y^  and 
y4  by  formulas  [18]  and  [19].  In  order  that  [24]  be  the  exact  expression 
of  the  probability  density  of  x,  all  the  coefficients  Ag,  A&,  A,,... com¬ 
puted  from  the  given  y^  by  [20],  [21]... must  be  zero . 

Let  us  note  that  the  characteristic  function  corresponding  to  [25] 
has  the  expression 


2  /2 


imt  - 


4>(t)  =  e 


A-  (i  at)  +  A.  (1  at) 


] 


[26] 


Thus  <J>(t)  is  an  entire  function  of  order  2. 

All  the  moments  y^,  u^,  y^,  can  easily  be  expressed  in  terms  of  A 
and  A^,  by  computing  (from  Equation  [25])  the  development  of  <J>(t)  in  a 
power  series 


<|>(t)  =  1  -  imt  - 


2  *2 
a  t 


+  y 


(it) 
3  3! 


+  y 


(it) 
4  4! 


One  way  to  test  the  accuracy  of  the  Gram-Charlier  approximation,  limited 
to  a  polynomial  of  order  4,  would  be  as  follows: 

a)  compute  Aj  and  A4  from  the  known  values  of  y^  and  y4  by  formulas 
[18]  and  [19]; 

b)  from  these  values  of  A^  and  A4  compute  the  moments  y^,  y^,  y^  .  . 


A 

Uc 


10  a2 


y 


3 


12 


15  a 


and  then  compare  these  values  with  the  known  values  y,.,  y^,  y^. 

We  could  consider  the  approximation  as  being  good  if  the  dif¬ 
ferences 


A 

Mr 


y6  •  y6 


A 

*7 


are  all  small. 

It  is  interesting  to  note  that  in  that  case  the  skewness  of  P(x)  is 
produced  by  only;  if  =  0,  the  function  P(x)  is  even  in  (x-m) . 

As  we  have  already  pointed  out,  in  order  that  P(x)  might  be  a 
probability  density,  the  polynomial  P2n(x)  must  satisfy  the  condition 

P2n(x)  -  0  [27] 

for  all  real  x;  this  condition  seems  to  have  been  neglected  very  often. 

Let  us  observe  that 


P4(x)  =  1  -  A3  H3  (if)  ♦  A4  H4  (SB) 

is  a  continuous  function  of  A,,  A..  For  A.  =  A.  =  0  we  have 

3’  4  3  4 

P4(*)  =  1 

and  the  condition  is  satisfied.  Thus,  if 

| A3 |  <  6  0  <  A4  <  a 

we  are  sure  that  Equation  [27]  is  satisfied. 

Putting 


i 
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1 


t 


%• 


we  find  the  following  bounds 


b 


a 


-<  V-  48  *y. 


5766  -  2840 


JOINT  PROBABILITY  DENSITY  OF  TWO  RANDOM  VARIABLES 


Let  (X,Y)  be  a  pair  of  random  variables;  suppose  we  know  the 


Yk  =  m  .  .  for  0  -  (j  +  k)  -  j 
J 

We  write  m  and  n  for  m,  and  m  , 

1,0  o,l 

(the  expectations  of  X  and  Y) 


X  =  m  Y  =  n 


From  hk  ^  we  can  compute  the  moments  centered  at  expectation 


(X-m)j  (Y-n)k  =  u  . 

J  »K 


We  write  o  and  x  for  ,  v  _  (the  variances  of  X  and  Y)  and  r 

6  j  O  O  j  fc 

for  Uj  j  (r  being  the  correlation  of  X  and  Y) .  We  always  suppose 


o  >  0  x  >  0 


2  2 
(X-mr  =  o 


2  2 
(Y-m)  =  t 


(X-m)  (Y-n)  =  r  o  x  |r|  -  1 


If  (X,Y)  follows  the  normal  law, 


[28] 


moments 

[29] 


[SO] 


[31] 


[32] 
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Prob 


[x  <  X  <  x  +  dx,  y  <  Y  <  y  +  dy]  = 


p  (x,y)  dxdy  [33] 


,  ,  YT  r  1  ,  A 

p  (x,y)  =  exp  L“  T  ♦  (x_m>  y_n)  J 


where 


4>  ('  /)  = 


(1  -  r  ) 


2 

JL_ 

2 

a 


2r^+I- 

at  2 

T  _ 


[34] 


A  = 


2  2  f.  2. 
a  t  (1  -  r  ) 


To  represent  the  given  m.  .  (or  ji .  .]  as  well  as  possible,  we 

1 » K  1 » K 

will  try  a  probability  law  of  the  type 


,  ,  Va 

P  (x,y)  =  —  exp 


T  ♦  (x  -  m,  y  -  n) 


P  (x,y) 


[35] 


where  P  (x,y)  is  a  suitably  chosen  polynomial.  We  will  give  an  expression 
for  P  (x,y)  in  terms  of  the  Hermite  polynomials  of  two  variables. 

HERMITE  POLYNOMIALS  OF  TWO  VARIABLES 

Consider  a  positive  definite  quadratic  form 

2  2 
<J>  (x,y)  =  ax  +2  bxy  +  cy 

2 

a>0  c>0  A  =  ac  -  b  >0 

and  its  adjunct 

,  r  r  \  c  _2  -  b  _  a  2 

*  (C  n)  =rc  -  2-?  n  *  —  n 


[36] 


[37] 


Putting 


15 


> 


K  =  ax  +  by  n  =  bx  +  cy 


[38] 


we  have  the  identity 


♦  (S,  n)  =  $  (x,  y) 


[39] 


Following  Reference  4,  pp  363-387,  we  define  the  Hermite  polynomials 


Hm,n  and  Gm,n  ^ 


exp 


h  (ax  +  by)  +  k  (bx  *  cy)  -  y  4  (h,k;l  yj  ^  Hmn  (x. 


y) [40] 


exp 


_  -1  +«»  I 

[hx  +  ky  -  7  *  CM)]  £  h  Gm,n  (x’ 


y) 


[41] 


Let  us  recall  some  useful  properties  of  those  polynomials 


Vn  * 


j  <fi  (x,y) 


,m+n 


.  m  . 

3  x  3  y 


-j  <t>  (x,y) 


] 


[42] 


Gm,n  ^  e 


T  *  '^»n)  m  n 
i  jn,n 


3£m  an 


n  L 


^  ({  n) 


] 


[43] 


The  polynomials  Hm  n  and  Gm  n  verify  the  following  conditions: 


- —  H  (x,y)  =  am  H  .  +  bn  H  . 

3  x  m,n  m  -  l,n  m,n  -  1 


[44] 


r —  H  (x,y)  =  bm  H  .  +  cn  H  . 

3  y,  m,n  v  J  m  -  l.rt  m,n  -  1 
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? —  G  (x.y)  =  m  G  , 

3  x  m,n  *7  m  -  1,  n 


- —  G  (x.y)  =  n  G  , 

3  y  m,n  m,  n  -  1 


Hm,n  (x-y)  -  5  Hm  -  1,  n  (x'y)  *  a  <■  '  »  H»  -  2,  n  <x'y>  t46J 
*  bn  Hm  -  l,  n  -  i  (x’y>  '  0 


Hm,n  (x-y)  '  n  Hm,n  -  1  <x-y>  *  bm  Hm  -  1,  n  -  1  (x'y> 


+  c  ("  -  «  Hm,n  -  2  (x->’)  *  0 


Gm,n  (x’y)  -  x  Gm  -  1,  n  (x’y)  *  T  C"  '  1J  Gm  -  2,  n  (x'y)  <47> 

'Tn  Gm  -  1,  n  -  1  ( x,y *  m  0 


Gm,n  (x>y)  -  y  Gm,n  -  1  <x-y)  '  Tm  Gm  -  1,  n  -  1  <x>y> 


*T(n  -  11  Gm,n  -  2  (x-y)  =  0 


The  first  polynomials  n  have  the  following  expressions: 

Ho,o  <x-y)  -  1 

\o  fx,y^  ”  * 

H0>1  (x-y>  *  i 

H2,o  (x-y)  *  ^  -  a 


Hj  j  (x,y)  =  5  n  -  b 


2 

Ho,2  ^x,y'  =  n  "  c 
H3,o  ^x,y^  =  Z5  -  3  a  £ 

H2  j  (x,y)  =  n  -  2b  5  -  a  n 

Hj  2  U,y)  =  Cn2  -  2b  n  -  c  c 

Ho,3  ^x,y-)  =  n3  “  3  c  n 

H.  (x,y)  =  C4  -  6  a  l}  +  3  a2 

4,o 

H3,l  (x»y)  =  S3n  -  3  b  £2  -  3  a  £  n  +  3  a  b 

H2  2  (x,y)  =  £2  n2  -  c  £2  -  2  b  £  n  -  a  n2  +  ac  +  2  b2 

Hj  3  (x,y)  =Cn3-3bn2-3c?n+3cb 
Ho,4  tx’y)  =  n4  -  6  c  +  3  c2 
The  first  polynomials  Gm  n  have  the  following  expressions: 

Go,o  (x-y)  ■  1 
Gl,0  (x-y)  =  x 
G0,i  (x-y)  =  y 
G2,o  (x’y)  ■  ^  -  X 

g!,i  (x-y)  ■ x  y 

go,2  (x-y)  =  y2  - 1 
G3,o  (x,y)  *  1(3  ‘  3  i_x 
G2,l  (x,y)  =I<2y*2'ax''Ay 
Gl,2  (x’y)  *  xy2  *  2^y  -  Ax 


[49] 
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Go,3  (x'y) 


3  _  a 

y  ■  3xy 


G4,o  fx-y) 


gj,i  (x-y) 


X4  -  6-f  X2  ♦  3-£y 
4  A2 


3c 


x  7  -  —  xy  +  — x 


3  b  2  3b  c 
*  2 


^2  2  (x ,y)  - 


G1>3  (x,y) 


2  2  a  2  c  2  4b  ac  _  b‘ 

x  y  -~x  -  Xy  *  ~Kxy'  *  —  *  2~ 5 

*  A 


3  3  a 


-  t  xy  +  t  y 


3  b  2  3ab 
.  2 


Go,4  (X’>° 


4  ,  a  2  ,  a 

y  -  6-j-y  *  3  — 


The  two  sequences  of  polynomials  Hm  n  and  have  the  orthogonality 

property 


VT 

2 


+00 


U  J 


+  0°  1  ,  /•  X 

-  -T  ♦  (x,y) 

e  H  (x,y)  G  (x,y)  dxdy  = 

m,n  ^  ,}  p,q  v  1 


*00  -00 


[50] 


6  6  m!  n! 

m,p  n,q 


This  is  the  most  original  contribution  of  Hermite:  the  H  or  the  G 

6  m,n  m,n 

are  not  orthogonal  to  themselves  but  they  are  biorthogonal ,  i.e.,  they  are 
orthogonal  to  each  other  in  this  very  special  way. 

Let  us  note  that  the  polynomials  n  (x,y)  and  Gm  n  (x,y) 
degenerate  in  a  product  Hm(x)  H^Cy)  if,  and  only  if,  the  coefficient  r  is 
equal  to  zero. 

GRAM-CHARLIER  APPROXIMATION  OF  A  TWO-DIMENSIONAL  PROBABILITY  DENSITY 


Let  us  now  suppose  that  in  the  probability  law  [35]  the  poly¬ 
nomial  P  (x,y)  is  the  sum  of  a  finite  number  of  Hermite  polynomials 
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In  the  quadratic  forms  [36]  and  [37]  we  will  assume 


thus 


b  * 


r 


c  = 


1 


and  [36]  becomes 

<P  (x,y)  =  — — y  (x2  -  2  r  x  y  +  y2) 
1  -  r 


the  adjunct  form  [37]  being 

(€,  n)  =  £2  +  2  r  E,  n  +  n2 


[51] 


[52] 


[53] 


[54] 


the  variables  (5,  n)  and  (x,y)  being  connected  by 

£  =  — ~ — j  (x  -  r  y)  n  =  — — '2  (-  r  x  +  y)  [55] 

1  -  r  1  -  r 

Now  it  is  obvious  that,  when  (x,y)  follows  the  normal  law,  we 

have 

po  (x,y) - i - — -  exp  [-  -i  ♦( 

t# i  2  L  ' 

2  i  o  t  yi  -  r 

where  the  quadratic  form  <p  is  defined  by  [53] . 

Let  us  point  out  again  that  pQ  (x,y)  is  the  probability  for  a 
pair  of  random  variables  X,  Y  which  are  not  necessarily  independent;  it 
applies  even  if  the  correlation  r  of  X,  Y  is  not  zero.  Now  the  principle 
of  our  method  is  to  use  the  two-dimensional  Gram-Charlier  series : 

j  +  k  =  4 

P  (*,y)  =  P0  (*./)[£  A.  k  Hj>k  (if-51).  Of-2)]  [57] 

j  +  k  =  0 

where  H.  ,  (x,y)  are  the  Hermite  polynomials  [48]  a,  b,  c,  n  having  the 
J » x 

values  [51]  and  [55]. 


Because  of  the  orthogonality  property  [50],  we  obviously  have 
the  fundamental  result 


Equations  [49]  give  the  values  of  the  coefficients  A.  ,  in  terms 

J 


of  the  moments  y.  ,  (defined  by  [31]);  we  have 

J,K 


0,0 

1 

> 
►— * 

0 

11 

0 

It 

pH 

O 

< 

0 

2,0  " 

0 

n 

pH 

< 

0 

A  ,  = 
o,2 

0 

V  x  _ 
0,0 

A  - 

y2,l 

"1,2 

3,o 

s  3 

6  a 

A2,l 

2  a2  x 

Al,2 

2  0  t2 

A  .  =  » 

o,3  ,  3 

’  6  x 


A .  =  "  3), 

4,0  24  4  * 

0 


.  3  r) 

i,  1  6  V  3  J  r/’ 

O  T 


A  1  ( ^2 .2  .  ,  2\ 

A2  2  ‘  4  '  2  2  "  "  r 

O  T 


A  1  /yi.3  -  X 

Al,3  ”  6  '  3  "  3  r)’ 

*  a  t 


Ao,4  *  JT(V  -  3  > 

T 
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Of  course,  as  for  the  one-dimensional  case,  to  test  the  fitness 
of  the  approximation,  we  must  compute  A  .  for  j  +  k  =  5  and  see  whether 

j  ,K 

they  are  small. 


etc. 
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